Rsc115: Sampling Bias and Logistic Models
نویسنده
چکیده
Biased sampling It is worth re-stating the point made in section 3.3, using Meng’s notation in which Oi = 1 indicates that unit i will volunteer if asked. We distinguish between the conditional distribution pn(x,y |o ≡ 1) given that a fixed sample of n individuals happens to have no refusers, and the distribution po=1(x,y) for the first n volunteers. In a random-effects model where the responses for distinct units are correlated, these distributions are usually different. Longford will be disappointed to learn that the conditional distributions pn(y |x,o ≡ 1) and po=1(y |x) are seldom equal either. Whether they are the same or different, it is the stratum distribution po=1(x,y) that is relevant for volunteer samples.
منابع مشابه
Absent or undetected? Effects of non-detection of species occurrence on wildlife–habitat models
Presence–absence data are used widely in analysis of wildlife–habitat relationships. Failure to detect a species’ presence in an occupied habitat patch is a common sampling problem when the population size is small, individuals are difficult to sample, or sampling effort is limited. In this paper, the influence of non-detection of occurrence on parameter estimates of logistic regression models ...
متن کاملSampling Bias and Class Imbalance in Maximum-likelihood Logistic Regression
Logistic regression is a widely used statistical method to relate a binary response variable to a set of explanatory variables and maximum likelihood is the most commonly used method for parameter estimation. A maximum-likelihood logistic regression (MLLR) model predicts the probability of the event from binary data defining the event. Currently, MLLR models are used in a myriad of fields inclu...
متن کاملSampling bias and logistic models
In a regression model, the joint distribution for each finite sample of units is determined by a function px.y/ depending only on the list of covariate values xD .x.u1/,. . .,x.un// on the sampled units. No random sampling of units is involved. In biological work, random sampling is frequently unavoidable, in which case the joint distribution p.y,x/ depends on the sampling scheme. Regression mo...
متن کاملPrediction of unwanted pregnancies using logistic regression, probit regression and discriminant analysis
Background: Unwanted pregnancy not intended by at least one of the parents has undesirable consequences for the family and the society. In the present study, three classification models were used and compared to predict unwanted pregnancies in an urban population. Methods : In this cross-sectional study, 887 pregnant mothers referring to health centers in Khorramabad, Iran, in 2012 were ...
متن کاملEstimating Population Abundance Using Sightability Models: R SightabilityModel package
This introduction to the R SightabilityModel package is a slight modification of Fieberg (2012), published in the Journal of Statistical Software. Sightability models are binary logistic-regression models used to estimate and adjust for visibility bias in wildlifepopulation surveys (Steinhorst and Samuel 1989). Estimation proceeds in 2 stages: 1) sightability trials are conducted with marked in...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008